Skip to main content
版本:1.7.0

模拟节点进程相关场景

介绍

kubernetes 节点进程相关场景,同基础资源的进程场景

命令

支持的进程场景命令如下:

参数

除了上述基础场景各自所需的参数外,在 kubernetes 环境下,还支持的参数如下:

参数名说明类型
evict-count限制实验生效的数量int
evict-percent限制实验生效数量的百分比,不包含 %int
labelsPod 资源标签,多个标签之间是或的关系string
namesPod 资源名string
kubeconfigkubeconfig 文件全路径(仅限使用 blade 命令调用时使用)string例: "/root/.kube/config"
waiting-time实验结果等待时间,默认为 20s,参数值要包含单位,例如 10s,1mstring

案例

杀指定 cn-hangzhou.192.168.0.205 节点上 kubelet 进程

yaml配置方式如下

apiVersion: chaosblade.io/v1alpha1
kind: ChaosBlade
metadata:
name: kill-node-process-by-names
spec:
experiments:
- scope: node
target: process
action: kill
desc: "kill node process by names"
matchers:
- name: names
value: ["cn-hangzhou.192.168.0.205"]
- name: process
value: ["redis-server"]

可以看到执行前后,redis-server 的进程号发生改变,说明被杀掉后,又被重新拉起

# ps -ef | grep redis-server
19497 root 2:05 redis-server *:6379

# ps -ef | grep redis-server
31855 root 0:00 redis-server *:6379

通过 kubectl get blade kill-node-process-by-names -o json 可以查看详细的执行结果(下发只截取部分内容)

{
"apiVersion": "v1",
"items": [
{
"apiVersion": "chaosblade.io/v1alpha1",
"kind": "ChaosBlade",
"metadata": {
"finalizers": [
"finalizer.chaosblade.io"
],
"generation": 1,
"name": "kill-node-process-by-names",
"resourceVersion": "9421288",
"selfLink": "/apis/chaosblade.io/v1alpha1/chaosblades/kill-node-process-by-names",
"uid": "24aed084-ff70-11e9-8883-00163e0ad0b3"
},
"status": {
"expStatuses": [
{
"action": "kill",
"resStatuses": [
{
"id": "ebe34959424fb022",
"kind": "node",
"name": "cn-hangzhou.192.168.0.205",
"nodeName": "cn-hangzhou.192.168.0.205",
"state": "Success",
"success": true,
"uid": "e179b30d-df77-11e9-b3be-00163e136d88"
}
],
"scope": "node",
"state": "Success",
"success": true,
"target": "process"
}
],
"phase": "Running"
}
}
],
}

执行以下命令停止实验:

kubectl delete -f kill_node_process_by_names.yaml

或者直接删除 blade 资源:

kubectl delete blade kill-node-process-by-names

blade 执行方式

blade create k8s node-process kill --process redis-server --names cn-hangzhou.192.168.0.205 --kubeconfig config

如果执行失败,会返回详细的错误信息;如果执行成功,会返回实验的 UID:

{"code":200,"success":true,"result":"fc93e5bbe4827d4b"}

可通过以下命令查询实验状态:

blade query k8s create fc93e5bbe4827d4b --kubeconfig config

{"code":200,"success":true,"result":{"uid":"fc93e5bbe4827d4b","success":true,"error":"","statuses":[{"id":"859c56e6850c1c1b","uid":"e179b30d-df77-11e9-b3be-00163e136d88","name":"cn-hangzhou.192.168.0.205","state":"Success","kind":"node","success":true,"nodeName":"cn-hangzhou.192.168.0.205"}]}}

销毁实验:

blade destroy fc93e5bbe4827d4b

常见问题

其他问题参考 blade create k8s 常见问题